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Abstract 

Consider two urns, A and B, where initially A contains a large number n of balls and B is 
empty. At each step, with equal probability, either we pick a ball at random in A and place 
it in B, or vice- versa (provided of course that A, or B, is not empty). The number of balls in 
B after n steps is of order i/n, and this number remains essentially the same after further 
steps. Observe that each ball in the urn B after n steps has a probability bounded away from 
and 1 to be placed back in the urn A after additional steps. So, even though the number 
of balls in B does not evolve significantly between n and n + ^/n, the precise contain of urn B 
does. 

This elementary observation is the source of an interesting two-time-scale phenomenon which 
we illustrate using a simple model of fragmentation-coagulation. Inspired by Pitman's con- 
struction of coalescing random forests, we consider for every rt G N a uniform random tree with 
n vertices, and at each step, depending on the outcome of an independent fair coin tossing, 
either we remove one edge chosen uniformly at random amongst the remaining edges, or we 
replace one edge chosen uniformly at random amongst the edges which have been removed 
previously. The process that records the sizes of the tree-components evolves by fragmentation 
and coagulation. It exhibits subaging in the sense that when it is observed after k steps in the 
regime k tn + s^/n with t > fixed, it seems to reach a statistical equilibrium as n — > oo; 
but different values of t yield distinct pseudo-stationary distributions. 
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1 Introduction 

Aging refers to systems that seem to reach a statistical equilibrium in a certain regime de- 
pending on two time-scales: the age of the system and the duration of the observation. If the 
observation scale is much shorter that the age of the system scale, then the effect is, more 
precisely, called subaging. It arises in a variety of models in random media; see for instance 
Ben Arous and Cerny [4] and references therein. The purpose of this note is to point out that 
subaging occurs in a process which evolves by fragmentation and coagulation, and is related 
to the work of Pitman [9] on coalescent random forests. It has its source in an elementary 
random urn dynamic which we now describe. 

1.1 A two-time-scale phenomenon 

Consider two urns, say A and B, and assume that at the initial time A contains n balls and B 
is empty. At each step A: = 1, 2, . . ., we flip a fair coin. If head comes up then we pick a ball in 
A uniformly at random and place it in B (provided of course that A is not empty, else we do 
nothing). Similarly, if tail comes up then we pick a ball in B uniformly at random and place it 
in A (provided of course that B is not empty, else we do nothing). Fix s,t > arbitrarily. By 
the invariancc principle, the numbers of balls in B after \ tn\ steps and after [tn + s^/nj steps 
are both close to y/nRt, where Rt is a reflecting Brownian motion evaluated at time t, i.e. 
has the distribution of the absolute value of an J\f{Q, t)-variable. Precisely because B contains 
about y/n balls in that period, each ball in B after [tn\ steps has a probability bounded away 
from and 1 to bo selected and placed back in A during the next [.s\/nj steps. In other 
words, even though the mimbcr of balls in B remains essentially unchanged between \tn\ and 
[tn + sy^J steps, the precise contain of urn B evolves significantly. 

Now imagine a stochastic process governed by the contain of urn B. Suppose that the one- 
dimension distributions of this process can be renormalized as n — )• oo in such a way that they 
have a non-degenerate limit, say fXr, when there are approximately r^/n balls in B. Let n be 
large, t > fixed and let s vary. The process observed after [tn + s^/n\ steps then seems to 
be in statistical equilibrium, in the sense that its one-dimensional distributions do not change 
much when s increases. More precisely the almost equilibrium law can be expressed as the 
mixture 



However this is only a pseudo-stationarity as this almost equilibrium distribution depends on 
the parameter t. 

The rest of this work is devoted to the rigorous analysis of this two-time-scale phenomenon 
in the special case when the stochastic process alluded above is a fragmentation-coagulation 
process induced by a natural modification of Pitman's coalescing random forests [9] . 
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1.2 Fragmentation-coagulation of random forests 

Pitman considered the fohowing random dynamics: one first picks a tree with n labeled vertices 
uniformly at random, and one deletes its edges one after the other, also imiformly at random. 
At each step the tree-component containing the edge which is deleted splits into two smaller 
trees, so after k < n — 1 steps, we obtain a forest with k + 1 trees, and the process terminates 
after n—1 steps when all vertices are isolated. Pitman's motivation for considering these edge- 
deletion dynamics stems from its connexion with the additive coalescence. The latter governs 
the evolution of a particle system in continuous times in which pairs of particles coalesce with 
a rate proportional to the sum of their masses. More precisely, Pitman pointed out that 
the process of the sizes of the tree-components in the forests resulting from the edge-deletion 
dynamics is a Markovian fragmentation chain, and that time-reversal yields the discrete-time 
skeleton of the additive coalescent started from n monomers (i.e. n atoms each having a unit 
mass) . 

On the one hand, it is well-known from the work of Aldous [1] that the uniform random 
tree on a set of n vertices can be rescaled (specifically edges by a factor l/\/n and masses 
of vertices by a factor 1/n) and then converges weakly as n — )• oo towards the Continuum 
Random Tree (for short, CRT). On the other hand, Evans and Pitman [7] proved that the 
additive coalescent started from a large number of monomers possesses a scaling limit, which 
is known as the standard additive coalescent. Roughly speaking, Aldous and Pitman [2] 
put the pieces together: they showed that the properly rescaled edge-deletion procedure on 
finite trees converges weakly to a Poissonian logging of the CRT. The latter induces the CRT 
fragmentation of masses, denoted here by (Ft)t>o, and in turn this yields the standard additive 
coalescent upon time-reversal. 

We consider in this work an evohition which combines edge-removal and edge-replacement, 
and should be viewed as an avatar of the urn dynamic depicted in the first part. In this 
direction it will be convenient to use marks on edges, agreeing that a mark on an edge means 
that this edge has been removed, while the absence of mark means that this edge is present. 
For each fixed n, we first pick a tree on n vertices imiformly at random and declare that 
initially all edges have no mark. At each step we flip a fair coin. If head comes up then we 
put a mark on one edge chosen uniformly at random amongst the un-marked edges (provided 
of course that there still remains at least one edge without mark; else we do nothing), while 
if tail comes up then we erase the mark of one edge chosen uniformly at random amongst the 
marked edges (provided of course that there exists such an edge; else we do nothing). For 

(n) 

every integer fc > 0, we denote by T^. the random forest which results from removing the 

(n) 

marked edges after the k-th. step and by the collection of the sizes of the tree-components 
in J^j^^ rescaled by a factor 1/n and ranked in the decreasing order. In a technical jargon, 
x[."^ is a random mass-partition, that is a decreasing sequence of positive real numbers with 
sum 1. Plainly the addition of a mark corresponds to a fragmentation event and erasure to a 
coalescence; in other words the chain (X^"^)^^^ evolves by fragmentation and coagulation. 
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Our purpose is to investigate the asymptotic behavior in distribution of as fc,n — oo. 
This is easy as far as only one-dimensional distributions are concerned. Indeed Donsker's 
invariance principle implies that when n is large, the number of marks after \tn\ steps is 
about y^Rt , where {Rt : t > 0) is a reflected Brownian motion. It then follows from results in 
Aldous and Pitman [2] that for every fixed t > 0, xj^"^j converges weakly as n — >■ oo towards 
Fijj, the CRT mass- fragmentation observed at the independent random time Rt- 
It turns out that things are more subtle for finite-dimensional marginals. The simple rescaling 
of times by a factor n is too crude and the asymptotic behavior in law is better revealed in 
the finer regime tn + s^/n. Indeed our main result says that for every fixed t > 0, the process 
(X|-"^^^^^^j )sgR converges weakly in the sense of finite-dimensional marginals towards some 
non-degenerate stationary process with stationary law given by the distribution of F/jj . Thus 
the fragmentation-coagulation process X^"^ exhibits subaging, in the sense that this process 
seems to reach a statistical equilibrium in the regime tn + s^/n when n is large and t fixed. 
The plan of the rest of this note is as follows. Our main result is stated and proved in Section 
2 after recalling some notions on the CRT. Finally Section 3 is devoted to some comments, 
complements and open questions. Our approach owes much to the construction by Aldous and 
Pitman of the standard additive coalescent via Poissonian cuts on the skeleton of a Continuum 
Random Tree. 

2 Main result 

We start by recalling some elements on the CRT, refering the reader to Aldous [1], Evans [6] 
and Le Gall [8] for background, and then state our main result. 

Let T be a Brownian CRT; in particular T is almost surely a compact metric space which has 
the structure of a real tree. Extremities of T arc called leaves; in other words a <E T is a leaf 
means that if a lies on some path joining two points b, c in T, then necessarily a = b oi a = c. 
The subset of leaves is totally disconnected; its complement is referred to as the skeleton. One 
endows T with a probability measure ji carried by the subset of leaves and with a sigma-finite 
length measure A carried by the skeleton. More precisely the distance between to points in T, 
say a, &, is given by A([a, &]), where [a, b] stands for the path joining a to 6 in T. 
We next introduce marks on the skeleton of T that appear and disappear randomly as time 
passes at some constant rate. Specifically, we fix a parameter r > and introduce a Poisson 
point process on R x (0, oo) x T with intensity 

^ds O -^e-"/2''du O dA . 
2 2r 

An atom {s,u,x) should be interpreted as follows: at time s a mark appears at location x 
on the skeleton and is erased at time s + u. In words, on any portion of the skeleton with 
length measure £, marks appear with rate £/2 and the lifetime of each mark is exponentially 
distributed with expectation 2r, independently of the other marks. For every s € K, we denote 
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by Air,s the random point measure on the skeleton of T induced by the marks present at time 
s. It is immediate to check that for each fixed s, conditionally on T, A4r,s is a Poisson point 
measure on the skeleton of T with intensity rX. 

Following Aldous and Pitman [2] , for every s G R we use the atoms of the Poisson random 
measure Air.s to decompose the set of leaves of T. More precisely, we decide that two leaves 
belong to the same component if and only if Mr,s has no mass on the path that joins those 
leaves (note that the probability of this event is exp(— r^) where £ is the length of the path 
between those two leaves). The components are closed in the subset of leaves, and we denote 
by Yr^s the sequence of their /x-masses ranked in the decreasing order. We stress that for every 
s G R, Yr,s has the same distribution as F^, the CRT mass- fragmentation evaluated at time 
r, which was mentioned in the Introduction and is described in Theorem 4 of [2]. 
Recall from the Introduction the construction of the fragmentation-coagulation chain X^"\ 
We are now able to state 

Theorem 1 For each fixed t > 0, the fragmentation- coagulation process 



converges weakly in the sense of finite dimensional distributions as n —>■ oo towards the mixed 
process 



where Rt denotes a random variable on (0, oo) which is independent of the preceding processes 
and has the distribution of a reflected Brownian motion at time t, viz. 



The rest of this section is devoted to the proof of Theorem 1; the scheme of the argument is 
adapted from Aldous and Pitman [2]. We first recall the formulation of the convergence of 
uniform random trees towards the CTR via reduced trees. 

Given 7", we sample a sequence Ui, . . . of i.i.d. random leaves according to the law /i, and 
for every integer i > 1, we denote by TZ{oo,i) the subtree reduced to the first i leaves, i.e. 
the smallest connected subset of T containing U\,...,Ui. The reduced tree 7?.(oo,i) is a 
combinatorial tree (simple graph with no cycles) with leaves labeled by and some 

unlabeled internal nodes. The paths between two adjacent internal nodes or between a leaf 
and an adjacent internal node are called edges. The lengths of edges are given by the length 
measure A on T, and the joint distribution of the shape and the edge-lengths is described by 
Lemma 21 in [1]. 

For every integer n > 2, we also consider a uniform random tree 7^ on a set of n vertices, say 
{1, . . . , n}, and assign length to every edge. For every 1 < i < n, we denote by 7?.(n, i) 

the sub-tree reduced to the first i vertices. We agree that internal nodes with degree 2 are 



(Xf 



\tn+Ss/n\ 




(Yflj_s)seM 




r > 0. 
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discarded, so that the edge-length between two adjacent vertices in TZ{n, i) is (1 +j)/^/n with 
j the number of internal nodes with degree 2 lying on the path connecting these vertices. It 
has been shown by Aldous (see (49) in [1] or Lemma 9 in [2]) that for every fixed i, 

TZ{n,i) =^TZ{oo,i) as n — >■ cxd , (1) 

in the sense of weak convergence of the joint distributions of shape and edge lengths. 

We then add and erase marks randomly on 7^ as explained in the Introduction. For each 
integer fc > 0, wc denote by A^^"'' a random point mcasiire on Tn that assigns a unit mass 
to each marked edge after k steps, and for every i < n hy M^j^'^^ the restriction of Al^"^ to 
TZ{n,i). Similarly, we also denote by M.l'^'^^ the restriction of the Poisson point measure A^r-.s 
to the reduced tree 7?.(oo, i). The key to Theorem 1 lies in the following limit theorem which 
can be viewed as a multi-dimensional extension of Equation (18) in [2]. Essentially it is a 
consequence of the law of rare events combined with the convergence of reduced trees. Recall 
that the reduced trees are defined by their shapes and edge- lengths, and that the set of shapes 
of trees with i vertices is finite. The reduced trees TZ{n, i) and TZ{oo, i) should thus be viewed 
as random variables with values in some Polish space, and weak convergence of random point 
measures should be understood in this setting. 



Lemma 1 Fix t > and an integer i. The process of random point measures on the reduced 
trees 

converges weakly in the sense of finite dimensional distributions as n ^ oo towards the mixed 
process of point measures on the reduced CRT 

where Rt denotes a random variable on (0, oo) which is independent of the preceding processes 
and has the distribution of a reflected Brownian motion at time t, viz. 



f{Rt e dr) = 




r > 0. 



Proof: We first deal with the one-dimensional convergence in the statement, rephrasing (and 
slightly developing) the argument for Equation (18) in [2]. 

By Skorohod's representation, we may assume that the convergence (1) for the reduced trees 
holds almost surely and not merely in distribution. Thus with a probability close to 1 when 
n is large, the shape of 7?.(n, i) coincides with that of TZ{oo, i), and the edge lengths of TZ{n, i) 
and of 7?.(oo, i) are close. We denote by n'^^ the total number of marks on Tn after k steps 
and consider a sequence (r„)„gN of integers with r„ ~ r^/n for some r > 0. We first work for 
each n conditionally on the event that = I'n- 
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Recall that Tn has n—1 edges, each of length 1 / ^/n, so the number of edges in a segment of Tn 
is ^/n times the length of that segment. We stress that when n is large, the number of edges in 
the reduced tree TZ{n, i) is of order y/n = o(n) and the number of marked edges in TZ{n, i) of 
order 0(1) = o(r„). This is important to justify the claims of asymptotic independence which 
will be made below. 

As (n — l)~^r„-yn ~ r, it follows from the law of rare events that when n is large, the number 
of marked edges after [tn+sy/n\ steps on a segment in Tn is approximately Poisson distributed 
with parameter given by r times the length of that segment, and further to disjoint segments 
correspond asymptotically independent Poisson variables. This entails that the conditional 
distribution of -A^lt^+g^j converges weakly as n oo towards a Poisson random measure on 
7?.(oo, i) with intensity rA, i.e. 

where the notation £{Z \ A) refers to the conditional law of the variable Z given the event A. 

We next present the main lines of the argument for extending (2) to multi-dimensional con- 
vergence by analyzing the evolution of the random point measures as s increases. It is readily 
checked that with probability one 

^S+(«+s')v^J ^^n-^rVE 

uniformly for s' > in an arbitrary bounded interval. Thus for every k = nt + 0{^Jn), each 
atom of A^^"'*' has a probability close to l/(2r„) ~ l/(2r-yn) to be erased at the next step, 
where the factor 1/2 accounts for the probability that head turns up when the fair coin is 
flipped. The probability that a given atom of -^^^n+s^/nl ^'^^ been erased after [s'\/nj 
further steps is close to 

{l-l/{2rn)f^^ ~ exp(-s7(2r)) 

when n is large; in other words if one unit of time corresponds to ^Jn steps, each atom of 
^\tn+s^\ removed after a time which is approximately exponentially distributed with 
mean 2r. A similar argument shows that asymptotically, each atom of -^[^^^^^^j is removed 
or not after [s'-^/nJ more steps independently on the other atoms. 

On the other hand, at each step, a mark appears on an un-marked edge with probability close 
to l/(2(n — r„)) ~ l/(2n). Recalling that any given mark is also erased at each step with 
probability close to l/{2r^/n), and neglecting the event of multiple appearances and erasures 
of a mark whose probability is of lower order, we deduce that the probability that an edge 



260 



Electronic Communications in Probability 



with no mark after [tn + s^/n\ steps be marked after [s'-^nj further steps is 



1 



2n \ 2r^/n J sjn \ \ 2r^/n J \ 2r^/n 

It then follows from the law of rare events that for any given segment of TZ(n, i) with length £ 
(i.e. with £y/n edges), the number of marked edges after [tn + (s + s')\/nj steps which were 
un-marked after [tn + s^/ri\ steps is approximately Poisson with parameter 

r^(l-exp(^-|- 

Further, one checks readily that the evolutions of marks on a given finite sequence of disjoint 
segments are asymptotically independent. 

Putting the pieces together, this shows that when n is large, the distribution -^^^^n+is+s')^} 
given Alf"'*| is close to that of a measure obtained from A^f"'*| by removing each atom 
with probability 1 — exp(— s'/ (2r)) independently one of the others (i.e. by thinning), and fur- 
ther adding an independent Poisson measure on TZ{n, i) with intensity r (1 — exp (— s'/2r)) A. 
Comparing with the evolution of the random point measure A^r^'*^ when s increases, we see 
that (2) can be extended as follows: for every s' > we have 

An iteration based on the obvious Markov property of the processes of the radom measures 
involved (in the case of (A^&'*')seR, this Markov property follows from the absence of memory 
of the exponential variables that specify the lifetime of the marks) enables us to conclude that 

in the sense of finite dimensional distributions. 

All that we need now is to get rid of the conditioning in (3), which is straightforward. Indeed 
Donsker's invariance principle shows that there is the weak convergence 



1 



where Rt is a random variable distributed as in the statement. On the other hand, it is easy 
seen from the construction of the random point measures A4r,s that the finite-dimensional 
distributions of the process [M'f^,'^^) depend continuously on the parameter r. We derive 

V ' ' s'>s 
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from above that 

in the sense of finite dimensional distributions, which is our claim. □ 
We now explain how Theorem 1 follows from Lemma 1, focussing on one- dimensional distri- 
butions as the multi-dimensional case is similar but with heavier notation. 

For every integers n, fc > 1, wc denote by H^"'' the random partition of {1, . . . , n} induced by 
the marks on edges of Tn after k steps, that is the blocks of H^"-* are characterized by the 
property that there is no mark on the paths connecting two vertices in the same block. By 
definition, X^"^ is the sequence of the sizes of the blocks of H^"^ ranked in the decreasing order 
and rescaled by a factor 1 /n. 

Similarly, for every s € K, we denote by Ilf??'' the random partition of N such that two integers 
J, j' belong to the same block of n[^'' if and only if there is no atom of the independent mixed- 
Poisson random measure Mr,s on the path in T from the leaf Uj to the leaf Uf. Plainly the 
random partition 11^^^ is exchangeable and the asymptotic frequencies of its blocks are given 
by Yr^s , i-e. the sequence of the /x- masses of the connected components of the CRT T cut at 
the atoms of Mr,s- 

For every integer i < n, we also denote by n^"'*-* (respectively by the restriction of n^"-* 

(respectively of llf^^) to the first i vertices. Plainly these restricted partition only depend 
on the reduced tree TZ{n,i) and TZ{oo,i), and the marks on their edges after k steps and the 
atoms of the random measure .Mr^''\ respectively. Lemma 1 implies that for every i, in the 
obvious notation, when n — )• oo there is the weak convergence 

Repeating the argument of Aldous and Pitman for proving Theorem 3 in [2] enables us to 
conclude from (4) and the preceding observations that 

^[tn+sV^] ^Rt,s, 

which is the one-dimensional version of Theorem 1. The multidimensional case is similar, using 
the full strength of Lemma 1. 



3 A comment, a complement, and an open question 

The stationary limiting process (Y/i'j,s)seR which appears in Theorem 1 is expressed as a 
mixture. The mixing variable Rt may be thought of as the effective age of the system as it 
represents the intensity of cuts along the skeleton of the CRT. In this direction, we note that 
the variables Rt are stochastically increasing with t. We also mention that Rt can be recovered 
from a sample of Y^^^g. Indeed, it follows easily from Theorem 4 in Aldous and Pitman [2] 
and the law of large numbers for Poisson processes that if Ft{i) denotes the i-th largest term 
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of F(, the CRT fragmentation of masses observed at time t, then with probability one 



2 

Ft(i) ~ — jn-TTj as i DO . 



Equivalently 

where Yj-^s (i) denotes the i-th largest term of Y^^s and we conclude that 



2 

Yr.sii) ^ — 5-7 as z 00 , 



We used uniform random trees Tn merely to stick to Pitman's original framework [9] . Nonethe- 
less the same results hold if we replace Tn by any other sequence of discrete random tree which 
converges to the Brownian CRT after rescaling edge lengths by a factor l/\/n and masses of 
vertices by a factor 1/n (for instance critical Galton- Watson trees with finite variance and 
conditioned to have total size n). In the same vein, the results of this note can be extended 
to certain sequences of so-called birthday trees. Indeed, Camarri and Pitman [5] have estab- 
lished the weak convergence of suitably rescaled birthday trees towards certain Inhomogeneous 
Continuum Random Trees. On the other hand, dynamics of edge-deletion for birthday trees 
bears the same connection to the additive coalescence as uniform random trees, except that 
the initial distribution of masses is inhomogeneous. The asymptotic behavior of the latter has 
been characterized by Aldous and Pitman [3] , in the study of the entrance boundary of the 
additive coalescence. We thus have all the ingredients needed to apply the arguments of the 
present work to this more general setting. Of course, the limiting processes will then have 
different distributions. 

Our aim in this work was to point at the phenomenon of subaging in a fragmentation- 
coagulation process. The model that we used for this purpose is easy to deal with although 
somewhat artificial. There are other discrete models for the evolution of random forests which 
may bo more natural, but are also much harder to investigate. Here is an example, which 
inspired by the siibtrcc prune and regraft algorithm; sec Chapter 9 of Evans [6] . We now work 
with rooted forests on n vertices, that is each tree has one distinguished vertex that serves as 
the root. At each step we flip a fair coin. With probability 1/2 we delete an edge chosen uni- 
formly at random in this forest. This disconnects the tree containing that edge into two rooted 
subtrees. With probability 1/2, we create a new edge between a vertex chosen uniformly at 
random and the root of a tree chosen uniformly at random amongst the trees to which the 
chosen vertex does not belong. Our result suggests that a similar subaging phenomenon might 
occur at the same scale as in the present study. Proving or disproving this property would be 
interesting, but does not seem easy. 
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